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INTRODUCTION 


The work reported on here was undertaken to improve, update 
and fully document the Santa Clara Valley Water District's 
regional regression equations. The regional equations are an 
important part of the District's methodology for determining 
design flood flows and their frequency of occurrence in Santa 
Clara County. 

Over the past ten years the District has utilized regional 
regression and correlation techniques to estimate design flood 
flows and their frequency of occurrence. In its endeavors to 
provide the best available methods, the District updates these 
regional regression equations periodically as more data becomes 
available (usually every five years). The year 1976 was the year 
during which this updating was done. Timely with that, the 
District convened a panel of expert hydrologists to review its 
methods. The panel reviewed, in their report (Review of Basic 
Hydrology Methods for Flood Control), the current methods and 
recommended short- and long-term improvements. This study also 
incorporates the panel's recommendations. 

The report discusses: 

1. The available basic streamflow data and associated 
problems. 

2. The stochastic generation of data and statistical 
frequency analysis. 

3. The determination of the physical and hydrometeorological 
characteristics of the watersheds. 

4. The multiple liner regression analysis used to determine 
the regional ^regional regression equations, and 
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5. The application and limitations of the developed 
equations. 
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RESULTS 


This study has improved the predictive capability of the 
regional regression equations. With the inclusion of new sta¬ 
tions and up-to-date records, the new equations become more 
reliable and applicable over a wider range of conditions. There 
are separate regression equations for mean and a which together 
define flow frequency regime. 

The mean can be estimated fairly well from a limited amount 
of data so no large changes were expected nor were found in this 
investigation. 

The greatest progress made in this study was in the develop¬ 
ment of the regression equation for the standard deviation. The 
coefficient of determination improved substantially with a very 
small standard error (.065). 

The regression constant was reduced and greater weight was 
put on the Mean Annual Precipitation and less on the area, a fact 
which made these equations more sensitive to the variability in 
the physical and climatological conditions. 

In this study, equivalent years of record at all utilized 
stations were based on a 61-year regionally-correlated data 
matrix. The use of the 20-station, 61-year data matrix is 
responsible for the improvement in defining the flow variability. 
The correlation and extension of each station's data provided a 
more uniform base upon which to perform regressions. 

Although the regional equations are most useful on ungaged 
natural watersheds, they may also be used to augment information 
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on watersheds where streamflow data is available. However, in 
situations where the regional equations are applied to gaged 
watersheds and don't concur, considerable additional comprehen¬ 
sive analysis of all available records must be undertaken. The 
objectives in this regard are to obtain the best possible esti¬ 
mate of the design flows. Thus, the results of the regional 
equations are to be checked and compared with data in or around 
the specific watershed being studied and are to be modified, if 
necessary, in those cases where good record exists, to more 
closely reflect the information inherent in the historic record. 

It must be pointed out, however, that the exercise of con¬ 
siderable judgement by an experienced and knowledgeable hydrolo¬ 
gist is essential to the practice of hydrology. Naive applica¬ 
tion of the methods reported on here could lead to erroneous 
results. 
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PHYSICAL SETTING 


Santa Clara County is located within the west-central coast 
ranges of California south of San Francisco Bay. The coast range 
landscape is characterized by a series of northwesterly trending, 
sub-parallel mountain ranges and intervening large and small 
valleys. Santa Clara Valley, which extends 50 miles southwest¬ 
erly through Santa Clara County into San Benito County, is a 
complex down-dropped structural trough. This alluvium-filled 
valley is bounded on the northeast by Diablo Range and on the 
southwest by Santa Cruz Mountains. The width of the valley 
varies from about 14 miles in the northern portion to less than 
one mile at the midpoint, then to about 5 miles in the southern 
portion of the county before widening again in San Benito County. 

Santa Clara Valley is separated into two surface water 
drainage areas (north and south) by a subtle topographic divide 
in the alluvium near Morgan Hill. 

Elevation of the floor of the valley ranges from sea level 
at San Francisco Bay to about 425 feet at the apex of the al¬ 
luvial fan which forms the drainage divide, and 140 feet at the 
basin outlet of South Santa Clara Valley. 

The east slopes of the Santa Cruz Mountains gradually rises 
from the foothills to moderately steep, rugged ridges to a peak 
elevation of 3798 feet at Loma Prieta. 

The western slopes of the Diablo Range are neither as rugged 
nor as steep as the Santa Cruz Mountains but rise to a peak eleva¬ 
tion of 4372 feet at Copernicus peak. 
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The mean annual precipitation in the County varies with 
elevation, from 14 inches in the valley floor to 56 inches in the 
high point of Loma Prieta. Most of the rain occurs between 
November and April. Storms are usually of the cyclonic type, 
with fronts usually covering the whole County at one time. 
Convective storms with heavy central rain and limited spacial 
distribution occur rarely. It is the activity of some cells 
within the cyclonic storms that cause the relatively varied 
distribution of rainfall in time and space. 

Snow rarely occurs, but when it does it is usually very 
light and melts off rapidly. Occasionally significant snow falls 
accrue in the surrounding mountains staying for up to several 
weeks. However, snowmelt does not contribute significantly to 
flood flows. 

Topographic features of the County and slope characteristics, 
which more strongly affect land use, and surface runoff are 
illustrated in Figure 1. 
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STOCHASTIC GENERATION AND STATISTICAL ANALYSIS OF DATA 


Basic Data 

Streamflow data were gathered from stations within and out¬ 
side the District. During the process of extracting the peaks 
and the annual 1, 2, 3, 4, 5, 10, and 15-day maximum volumes, the 
data were screened for reliability and consistency. Figure 1 
shows the location of these stations while Figure 2 shows the 


available record. Gut or 
considered for this study, 
operated by the District. 


i.1_n 4- T . -i- r n-v -i 1 tr 

•cne 3/ bldLlUllb uiictu weit: pj- xiuci.i. _l-L._ y 


24 are U.S.G.S. and 13 are owned and 


Most of the U.S.G.S. stations in Santa Clara County and 
some from adjacent counties were used in this study. The stations 
in nearby counties were selected on the basis of apparent similar¬ 
ities in physiographic characteristics. 

Most District stations were initially considered for the 
analysis. Stations downstream of reservoirs or some other regu¬ 
lating facilities were removed from consideration, since inclusion 
would impose nonhomogeniety of data in the statistical computa¬ 
tions. The remaining stations were reviewed for length of record 
and an arbitrary ten-year minimum was selected as a limiting 
criterion. 

Also, only one station per watershed was used in an effort 
to prevent unnatural weighting toward a specific type of stream- 
flow pattern or distribution. 

The stations used for the initial analysis are listed in 
Table 1. 
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In general, the U.S.G.S. streamflow stations were more 
reliable than the District stations. The correlation between the 
District's and U.S.G.S. stations were poor at times. 

Screening the obvious bad data points out of the record 
improved the correlation coefficients greatly as will be shown 
later. 

For the years 1926-1960, U.S.G.S. streamflow data were 
obtained from "Geological Survey Water Supply Papers, Surface 
Water Supply of the United States". After that, 1961-1975, the 
data were obtained from "Water Resources Data for California, 

Part I. Surface Water Records". The annual peaks and the 1, 2, 
3, 4, 5, 10, and 15-day volumes were extracted. 

District streamflow data were obtained from the District's 
records. As with the U.S.G.S. stations, the annual peaks and 
volumes were also extracted. 

A list of all the data together with the time and date of 
occurrence is available in Appendix A. The inconsistencies that 
were discovered and corrected in the basic data are also listed 
in the same Appendix. 
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TABLE 1 



U.S.G.S. or 

Location 

Station Name 

District Number 

Numbers 


Arroyo Seco 

1520 

1 

Cedar Creek 

1529 

2 

Pacheco Creek * 

1530 

3 

Uvas above Uvas 

1539 

4 

Bodfish Creek 

1541 

5 

Uvas Creek 

1540 

6 

Corralitos Creek 

1591 

7 

Aptos Creek 

1597 

8 

West Br. Sequel 

1598 

9 

Soquel 

1600 

10 

Zayante 

1603 • 

11 

San Lorenzo River 

1605 

12 

Branciforte 

1615 

13 

Colma a S. F. 

1627 

14 

Redwood Creek 

1623 

15 

San Francisquito 

1645 

16 

Madatero Creek 

1660 

17 

Los Gatos 

1680 

18 

Saratoga 

1695 

19 

Coyote-Gilroy 

1698 

20 

Coyote-Madrone 

1700 

21 

Upper Penitencia 

1721 

22 

Dry Creek 

1805 

23 

San Lorenzo 

1810 

24 

Fisher Cr. @ Monterey 

10 

25 

Road 

Ross Cr. @ Blossom 

21 

26 

Hill 

« 


Smith's Creek 

27 

27 

San Tomas Creek 

29 

28 

Wildcat Cr. @ Sp 

30 

29 

Calabazas Cr. @ 

31 

30 

Rainbow 

Permanente Cr. near Hclley 

32 

31 

Hale Cr. near Magdalena 

33 

32 

Permanente Cr. 

34 

33 

Metcalf Cr. above Coyote 

40 

34 

Canal Extension 

Los Animas Cr. @ 

56 

35 

O'Connell Ranch 

Packwood Cr. @ Jackson 

57 

36 . 

Ranch 

Little Arthur Creek 

65 

37 
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Data Analysis 


This section discusses the systematic screening of data and 
the logical grouping of stations in order to improve correlations. 
The streamflow data, i.e., annual peaks and volumes for various 
durations, were analyzed by the use of the U.S. Army Corps of 
Engineers' Hydrologic Engineering Center (HEC) program titled 
"Regional Frequency Computation". 

This program performs frequency computations of hydrologic 
events utilizing the information from adjacent stations. 

Frequency statistics are computed for recorded events at each 
station and for each duration. Statistics for each station are 
then adjusted to the complete record of selected base stations 
in accordance with the procedure given in "Statistical Methods 
in Hydrology" by Leo R. Beard, January 1962. Missing events 
are estimated by the use of cross-correlation matrices of groups 
of stations, so that complete sets of events are obtained for all 
years at all stations while preserving all intercorrelations. 

The adopted statistics of estimated and observed data at all 
stations were used in the multiple linear regression and cor¬ 
relation. A listing of the stations together with their record 
lengths is shown on Figure 2. 

Grouping 

The method of computation in the previously mentioned HEC 
program requires that a maximum of only ten stations be grouped 
together for each run. Since 37 stations were initially consid¬ 
ered, it was not possible to cross-correlate all stations together 
with all durations (i.e.. Peak 1, 2, 3, 4, 5, 10, and 15-day 
volumes). Therefore, it was necessary to "group" the stations 
in some logical fashion. 
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The initial criteria for grouping stations were based on 
their geographical proximities. Preliminary grouping was done 
by visual inspection after first plotting the stations on a 
topographic map,(Figure 1). 

Grouping was also necessary since Arroyo Seco near Soledad 
has the longest record. Hence, all the stations could be overly 
affected by it. The grouping process was used to alleviate the 
possible excessive weighing of any one station. 

In this grouping a matrix of cross correlations was created. 
The objectives were to obtain the groups that, together, would 
yield the best cross correlation coefficient. 

The following chapter discusses the several computer trial 
runs needed to obtain the best grouping. 

Analysis of Computer Output 

The criterion selected for determining the "goodness" of 
a particular station's grouping was the correlation coefficients 
matrices of the recorded and estimated data. Each correlation 
coefficient matrix and grouping for each run was carefully re¬ 
viewed before any adjustments were made. Additionally, relative 
information ratios as well as the basic data were considered in 
the evaluation of a streamflow station. 

The stations were arranged in runs and groups. Each run 
contained many groups of stations. These groups were arranged 
in such a way as to enable station statistics and data to be 
saved and used again for subsequent groups. This procedure 
was utilized to maintain consistency in data transfer and to 
tie the stations together. 
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The groups were arbitrarily numbered from 10 to 16. After 
reviewing the results of each group, subsequent runs were made 
with the modifications deemed necessary, and were labeled run 
1, 2, 3, 4, 5, and run 6, respectively. 

In the preliminary runs, only 50 years of data were used 
for the study. A dimension limit in the program prohibited more 
than 50 years of data with 8 durations (i.e., 400 data points). 
Extension of the necessary dimensions required a somewhat complex 
procedure that was not attempted. Only one station, Arroyo Seco, 
had an additional 20 years of record. However, for the final 
runs it was decided to include 11 of these 20 additional years 
at Arroyo Seco. Therefore, the number of durations investigated 
was reduced. 

Visual inspection of the correlation matrix for some of the 
groupings showed that a few correlation coefficients were quite 
low. However, an analysis considering the relative information 
ratio as described in "Use of Correlation to Improve Estimates 
of the Mean and Variance" by Myron B. Fiering, U.S.G.S. Professional 
Paper 434-C, demonstrated that the information gained by esti¬ 
mating additional years of record exceeded the information loss 
due to the decreased correlation coefficients even at places 
where the correlations are relatively low. This being the case, 
it was considered appropriate to leave in some of the stations 
even though they produced low correlation coefficients. 

For some unknown reason very large negative skew coefficients 
exist within the data. Low and high outliers were looked into 
according to the method described in "A Guideline for Determining 
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Flood Flow Frequencies", U.S. Water Resources Council. The 
largest flow for each station that could be classed as a low 
outlier was determined. (This procedure does not consider zero 
flows as low outliers.) Comparison of these flows with the recor¬ 
ded data (Table 2) revealed that there were no "low outliers" 
present. 

Based on the correlation coefficient matrixes and considering 
the extra effort required to realize possible small improvements 
for running additional groupings, the runs were terminated and 
the obtained statistics (dependent variables) from run 6 were 
adopted and are listed on Table 3. Appendix B is the complete 
printout of all the basic data together with the various correla¬ 
tion coefficient matrixes. The estimated data and the various 
steps needed to obtain the adopted statistics are clearly shown. 
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PEAK 


TABLE 2 


ANALYSIS OF OUTLIERS FOR PEAKS _ 

_ Low Outliers 

w/generalized G = -0.60_Recorded G_N Years_o'_X_Limit 


Arroyo Secco 

11152000 

-0.575 

50 

0.378 

3.877 

205.1 

Cedar Creek 

11152900 

-1.700 

14 

0.759 

2.564 

1.1 

Pacheco Creek 

11153000 

-1.492 

36 

0.922 

3.060 

0.3 

Uvas (above Uvas) 

11153900 

-1.579 

14 

0.303 

3.384 

239.2 

Bodfish Creek 

11154100 

-0.411 

16 

0.479 

2.455 

6.7 

Uvas Creek 

11154000 

-0.392 

27 

0.402 

3.375 

74.3 

Corralitos Creek 

11159150 

-0.920 

15 

0.538 

2.639 

6.8 

Aptos Creek 

11159700 

-1.125 

17 

0.583 

2.616 

4.1 

West Branch Soquel 

11159800 

-0.820 

14 

0.532 

2.956 

15.5 

Soquel 

11160000 

-0.930 

24 

0.477 

3.368 

41.6 

Zayante 

11160300 

-1.253 

18 

0.587 

2.903 

7.3 

San Lorenzo River 

11160500 

-0.344 

23 

0.450 

3.775 

137.3 

Branciforte 

11161500 

-0.206 

19 

0.381 

3.173 

68.2 

Colma (@ San Francisco)11162720 

0.936 

11 

0.182 

3.073 

4.5 

Redwood Creek 

11162800 

-1.610 

16 

0.403 

2.310 

8.7 

San Francisquito 

11164500 

-2.384 

35 

0.828 

2.943 

0.5 

Matadero Creek 

11166000 

-2.518 

23 

0.743 

2.322 

0.4 

Los Gatos 

11168000 

-0.566 

15 

0.441 

3.261 

60.0 

Saratoga 

11169500 

0.254 

42 

0.379 

2.621 

14,040.5 * 

Coyote-Gilroy 

11169800 

-1.676 

15 

0.612 

3.334 

18.9 

Coyote-Madrone 

11170000 

-1.499 

10 

0.522 

3.449 

67.7 

Upper Penitencia 

11172100 






Dry Creek 

11180500 

-1.295 

16 

0.787 

2.181 

0.3 

San Lorenzo 

11181000 

-0.472 

29 

0.546 

2.973 

8.0 

Fisher Creek 

10 

-0.684 

13 

0.477 

2.426 

7.4 

Ross Creek 

21 

-0.393 

29 

0.403 

2.177 

4.5 

Smith's Creek 

27 






San Tomas Creek 

29 

-2.433 

26 

0.691 

2.089 

0.3 

Wildcat Creek 

30 

-0.837 

29 

0.433 

2.513 

7.5 

Calabazas Creek 

31 

0.191 

27 

0.413 

2.453 

9,963.0* 

Upper Permanente Creek 32 

-0.308 

25 

0.330 

1.815 

4.0 

Magdalena (Hale) Creek 33 

-2.424 

30 

0.821 

1.853 

0.1 

Permanente Creek 

34 

-1.790 

19 

0.824 

2.202 

0.2 

Metcalf Creek 

40 

-0.199 

18 

0.505 

0.720 

0.1 

Los Animas Creek 

56 

-1.646 

13 

0.596 

2.480 

3.4 

Packwood Creek 

57 

-0.603 

12 

0.766 

1.785 

0.2 


Little Arthur Creek 65 
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TABLE 3 


REGIONAL REGRESSION ANALYSIS 
ADOPTED STREAM GAGE STATISTICS FOR 
PEAK FLOWS 




D .A. 

M 

< r 

1% 

CSM * 

1. 

Aptos Creek 

12 .2 

2 .621 

.536 

4,252 

349 

2. 

Bodfish Creek 

7 .4 

2 .448 

.488 

2,319 

313 

3 . 

Branciforte Creek 

17 .3 

3 .130 

.357 

6,326 

365 

4. 

Calabazas Creek 

SCVWD #31 

4.73 

2 .479 

.449 

2,104 

445 

5 . 

Cedar Creek 

12 .8 

2 .355 

.896 

10,951 

855 

6 . 

Corralitos Creek 

10 .6 

2 .615 

.477 

3,249 

306 

7 . 

Coyote Creek 
near Gilroy 

109 .0 

3 .365 

.560 

26,169 

240 

8 . 

Coyote Creek 
at Madrone 

196 .0 

3 .478 

.809 

99,751 

508 

9 . 

Dry Creek 

9.41 

2 .195 

.766 

4,315 

458 

10. 

Fisher Creek 

SCVWD #10 

15 .5 

2.263 

.663 

3,231 

208 

11. 

Hale Creek 

SCVWD #33 

2 .8 

1.903 

.789 

2,433 

869 

12 . 

Los Animas Creek 

SCVWD #56 

37 .0 

2.343 

.729 

5,170 

139 

13 . 

Los Gatos Creek 

38 .6 

3 .268 

.421 

11,467 

297 

11. 

Matadero Creek 

7 .26 

2 .456 

.469 

2,176 

300 

15 . 

Metcalf Creek 

SCVWD #40 

.80 

.684 

.510 

43 

55 

16. 

Pacheco Creek 

146 .0 

2 .971 

.917 

49,540 

339 

17 . 

Packwood Creek 

SCVWD #57 

10 .1 

1 .652 

.910 

2,305 

228 

18. 

Permanente Creek 

SCVWD #34 

15 .9 

2.296 

.699 

4,074 

256 

19. 

Redwood Creek 

1.82 

2 .291 

.451 

1,376 

756 

20. 

Ross Creek 

SCVWD #21 

2 .86 

2.188 

.413 

921 

322 

21. 

San-Francisquito Creek 

37 .5 

3 .044 

.551 

12,019 

320 

22 . 

San Lorenzo Creek 
at Haywood 

37 .5 

2 .987 

.579 

11,899 

317 

23 . 

San Lorenzo River 

111.0 

3 .726 

.400 

30,060 

270 

24. 

San Tomas Creek 

SCVWD #29 

3 .25 

2 .175 

.716 

3,319 

1021 

25 . 

Saratoga Creek 
at Saratoga 

9.2 

i n 

2 .598 

.364 

1,915 

208 



TABLE 3 (continued) 




D .A. 

M 

cr 

1% 

CSM * 

26 . 

Soquel Creek 
at Soquel 

40 .2 

3.295 

.411 

11,686 

290 

27 . 

Upper Permanente 
SCVWD #32 

8.2 

1.870 

.336 

317 

38 

28 . 

Uvas Creek at 

Morgan Hill 

30 .3 

3 .391 

.348 

11,097 

366 

29. 

Uvas Creek above 
Uvas Reservoir 

21.0 

3 .318 

.364 

10,053 

478 

30. 

West Branch 

Soquel Creek 

12 .2 

2 .974 

.471 

7,235. 

593 

31. 

Wildcat Creek 

SCVWD #30 

1.15 

2 .502 

.454 

2,267 

1971 

32 . 

Zayante Creek 

11 .1 

2 .825 

.546 

7,103 

639 


*Cubic feet per second per square mile 
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Regional Regression Analysis 


In the previous chapter, statistics (mean, standard deviation 
and 1% flows) were obtained for gaged watersheds. These sta¬ 
tistics contain information available from all gaged watersheds 
as well as representing information transferable to the region. 

In this chapter, these statistics are used to develop equations 
to determine design floods and their frequency of occurrence. 

The mathematical model used was the Corps of Engineers, HEC Program 
Multiple Linear Regression, January 1975. 

In general, multiple linear regression determines the relation 
ships between one dependent and two or more independent variables. 
In this case the dependent variables for the various equations are 
the statistics of recorded data, determined previously, while the 
independent variables are watershed climatological or physical 
parameters described below. 

Independent Variables 

The rate of runoff from a basin is a function of its hydro- 
logic and physical characteristics. To describe the relation 
between design flows and basin characteristics, it was first 
necessary to determine which of the hydrologic characteristics 
are significantly related to the statistics of flood peaks and 
volumes. The choice of independent variables was limited to those 
readily obtainable. Thirteen of the geomorphic and climatic varia¬ 
bles that were considered in this analysis as independent variables 
are described below. The actual values were obtained and are 
listed on Table 4. 

Area, A - The area planimetered from a U.S.G.S. 7-1/2 minute 
quad map in square miles. 
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M.A.P., P - The mean annual basin precipitation determined 
from the District's 1972 Mean Annual Precipitation map (M.A.P. )• For 
stations outside the limits of the District's map, the M.A.P. 
was estimated from the U.S.G.S. map developed for all of 
California. 

Bifurcation Ratio, Rb - The average of the ratios of the 
number of stream segments of a given order to the number of stream 
segments of the next higher order. Stream orders were determined 
from designated flow lines on U.S.G.S. 7-1/2' quad maps. 

Length of Main Stream, Lm - Length of longest flow path from 
stream gage location to furthest distant point on the drainage 
divide (in miles). 

Shape Factor, d/Lm - The average of the distances from the 
points on a uniform grid (50-70 points over the basin) to the 
stream gage location (as measured along the flow path) divided 
by Lm, the length of the main stream. 

Effective Basin Width, Wb - Total Area A divided by length 
of main stream Lm. 

Slope of Main Stream, Sm - Average slope of the main stream 

in Ft/Ft. Profile plotted from U.S.G.S. 7-1/2 ' quad maps. 

Total Stream Lengths, Ls - Total length (in miles) of all 

the stream paths designated on the U.S.G.S. 7-1/2 1 quad maps. 

Drainage Density, DP - Total stream lengths, Ls divided by 
the drainage area, A. 

Distance to Center of Area, Lea - Distance (in miles) from 
the stream gage to the center of area of the drainage basin 
measured along the stream path to a line drawn through the center 
of area perpendicular to the stream. 
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Overland Flow, d - An estimate of the length of the overland 
path (in miles) defined as the reciprocal of twice the drainage 
density, DD. 

Basin Factor, LLS - It is determined from a combination of 

three measured parameters: LmLca. 

S 1/2 
m 

Average Basin Slope, So - From a uniform grid of 50-70 points 
per basin the slope at a grid point is the average of two sides of 
the grid square. The average of all the slopes at each grid point 
is the average basin slope. So in ft/ft. Most easily calculated by 
using contour interval and counting contour lines. 

Regression and Correlation 

In this analysis it is important to recognize the differ¬ 
ences between regression and correlation. 

Regression is the mathematical method that yields equations 
of one dependent and one or more independent variables. Cor¬ 
relation is the process by which the degree of association 
between these variables is defined. The degree of association 
is a measure of how good the equations are. 

Correlation theory requires that the data of the two pro¬ 
cesses to be correlated are normally distributed, although this 
condition should not limit the application of that theory. 

McDonald (1957) reported that the "non-normality is of inconse¬ 
quential magnitude physically". The other condition of correla¬ 
tion is the assumption that the processes are free from measure¬ 
ment errors. Regression requires the following conditions in 
addition to what is required for correlation. 
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TABLE 4 


REGIONAL REGRESSION ANALYSIS 
BASIN PARAMETERS 


STATION 

AREA 

A 

M.A.P. 

P 

BIFURCATION 

RATIO 

Rb 

LENGTH 

OF MAIN 

STREAM 

Lm 

SHAPE 

FACTOR 

d/Lm 

EFFECTIVE 

BASIN 

WIDTH 

Wb 

SLOPE 

OF MAIN 

STREAM 

Sm 

TOTAL 

STREAM 

LENGTHS 

Ls 

DRAINAGE 

DENSITY 

DD 

DISTANCE 

TO CENTER 

OF AREA 

Lea 

OVERLAND 

FLOW 

d 

LraLca 

Sn, 1/2 

AVERAGE 

BASIN 

SLOPE 

So 


Miles 2 

Inches 


Miles 

Ft/Ft 

Miles 

Ft/Ft 

Miles 

Miles/Mile 2 

Miles 

Miles 

LLs 

Ft/Ft 


1159700 


Aptos Creek 

12.2 

36 

4.00 

9.09 

.550 

1.34 

1154100 

Bodfish Creek 

7.4 

30 

3.33 

5.68 

.455 

1.30 

1161500 

hr:.;.riforte Creek 

17.3 

40 

3.50 

9.09 

. 558 

1.90 

scvwn #31 

Calabazas Creek 

1152900 

Cedar Creek near 

4.73 

24 

3.2 

5.11 



Bel^l Station 

12.8 

21 

2.83 

8.14 

.518 

1.57 

1159150 

Corralitos Creek 

10. 6 

40 . 

4.25 

5.76 

.602 

1.84 

1169800 

Coyote Creek 
near Gilroy 

109.0 

24 

3.68 

28.41 

.440 

3.84 

1170000 







Coyote Creek 
at Madrone 

196.0 

22 

4.17 • 

41.29 

.451 

4.75 


,0235 

21.93 

1.80 

4.74 

. 278 

280.8 

.2 390 

.0327 

14.58 

1.97 

1.89 

.254 

59.4 

.2412 

,0172 

31.63 

1.83 

4.74 

.274 

32R. 3 

— 

. 0278 



2.73 


83. 6 


0271 

27.27 

2.13 

3.98 

.235 

196.8 

. 2376 

0395 

20.8 

1.97 

3.03 

.254 

87.8 

— 

.0141 

182.58 

1.68 

10.04 

.299 

2405.0 

— 

.0098 

343.56 

1.75 

10.98 

. 285 

4575.8 





TABLE 4 (CONTINUED) 


REGIONAL REGRESSION ANALYSIS 
BASIN PARAMETERS 






LENGTH 


EFFECTIVE 

SLOPE 

TOTAL 


DISTANCE 



AVERAGE 




BIFURCATION 

OF MAIN 

SHAPE 

BASIN 

OF MAIN 

STREAM 

DRAINAGE 

TO CENTER 

OVERLAND 

LmLca 

BASIN 


AREA 

M.A.P. . 

RATIO 

STREAM 

FACTOR 

WIDTH 

STREAM 

LENGTHS 

DENSITY 

OF AREA 

FLOW 

Srn 1 / 2 

SLOPE 

STATION 

A 

P 

Rb 

Lm 

d/Lm 

Wb 

Sm 

Ls 

DD 

Lea 

d 

So 


2 

Miles 

Inches 


Miles 

Ft/Ft 

Miles 

Ft/Ft 

Mi les 

Miles/Mile 

Mi les 

Miles 

LLs 

Ft/Ft 

1igQCnn 

Dry Creer. '? 

Union City 

9.41 

19 












SCVWD #10 

Fisher Creek 

15.5 

20 

2.23 

8.14 

- 

1.90 

.0025 

23.5 

1.52 

3.79 

.330 

617 .0 

- 

SCVWE #33 














Hale Creek 

2.8 

19 

4 

2.12 

.565 

1.32 

. 0308 

5.11 

1.83 

1.00 

.273 

12.1 

.1040 

SCVWD =56 

Los Anitas Creek 

37.0 

20 

3.80 

14.39 

.546' 

2.57 

.0146 

89.20 

2.41 

6.06 

.207 

721.6 

- 

1168000 

Lop Gatos Creek 

38.6 

43 

4.11 

- 12.88 

.410 

3.00 

.0260 

82.58 

2.14 

6.44 

. .234 

514.0 

.2810 

1166000 

Matadero Creek 

7.26 

19 

7 

5.91 

.573 

1.23 

.0138 

13.03 

1.79 

3.22 

.279 

162.0 

.1008 

SCVWD #40 
.Metcalf Creek 

.80 

17 

2 

2.05 

- 

. 390 

.1074 

2.65 

3.31 

1.04 

.151 

6.5 

- 

1153000 

Facheco Creek 

146.0 

21 












SCVWD #57 

Packwood Creek 

10.1 

20 

4.67 

7.01 

.633 

1.44 

.0319 

22.35 

2.21 

4.36 

.226 

171.2 

- 



TABLE 4 (CONTINUED) 


REGIONAL REGRESSION ANALYSIS 
BASIN PARAMETERS 


STATION 

AREA 

A 

M.A.P. 

P 

BIFURCATION 

RATIO 

Rb 

LENGTH 

OF MAIN 

STREAM 

Lm 

SHAPE 

FACTOR 

d/Lm 

EFFECTIVE 

BASIN 

WIDTH 

Wb 

SLOPE 

OF MAIN 

STREAM 

Sm 

TOTAL 

STREAM 

LENGTHS 

Ls 

DRAINAGE 

DENSITY 

DD 

DISTANCE 

TO CENTER 

OF AREA 

Lea 

OVERLAND 

FLOW 

d 

LmLca 

Sm 1/2 

AVERAGE 

BASIN 

SLOPE 

So 


Miles^ 

Inches 


Miles 

Ft/Ft 

Miles 

Ft/Ft 

Miles 

2 

Miles/Mile 

Miles 

Miles 

LLs 

Ft/Ft 

1172100 

P-rnit jncia Creek 

21.1 

21.5 

3.06 

11.93 

. 555 

1.77 

.0311 

41.29 

1.96 

10.61 

.256 

717.6 

. 2479 

SCVWD #34 
Ptrnanente 

15.9 

71 

4.13 

9.85 

.483 

1.61 

.0226 

28.22 

1.77 

4.73 

.282 

309.9 

— 

11628000 

Kc -V.'ood Creek 

1.82 

9 

2.00 

2.08 

.630 

0.87 

.0236 

2.46 

1 . 35 

1.14 

.370 

15.4 

.1233 

SCVWD #21 

Ross Creek 

2.86 

25 

3 

1.89 

.600 

1.51 

.0360 

3.79 

1.33 

.83 

. 376 

8.3 

.1669 

1164 600 

Spn Francisquito 
Creek 

37.5 

31 

3.71 

11.74 

. 578 

3.19 

.0129 

73.86 

1.97 

5.87 

.254 

606,7 


1181000 

San Lorenzo 

Creek at Hayward 

37.5 

24 

4.10 

8.79 

.6353 

4.27 

.0152 

80. 30 

2.14 

4.55 

.2 34 

324.1 

.2168 

1160500 

San Lorenzo 

River 

111.0 

•48 

3.66 

21.21 

.5379 

5.23 

.0071 

135.04 

1.22 

10.04 

.411 

2519.6 

_ 

SCVWD #29 

San Tomas Creek 

3.64 

29 

2.00 

5.42 



.0385 



2.73 
















TABLE 4 (CONTINUED) 
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REGIONAL REGRESSION ANALYSIS 
BASIN PARAMETERS 


STATION 

AREA 

A 

M.A.P. 

P 

BIFURCATION 

RATIO 

Rb 

LENGTH 

OF MAIN 

STREAM 

Lm 

SHAPE 

FACTOR 

d/Lm 

EFFECTIVE 

BASIN 

WIDTH 

Wb 

SLOPE 

OF MAIN 

STREAM 

Sm 

TOTAL 

STREAM 

LENGTHS 

Ls 

DRAINAGE 

DENSITY 

DD 

DISTANCE 

TO CENTER 

OF AREA 

Lea 

OVERLAND 

FLOW 

d 

LmLca 

Sm 1/2 

AVERAGE 

BASIN 

SLOPE 

So 


2 

Miles 

Inches 


Miles 

Ft/Ft 

Miles 

Ft/Ft 

Miles 

2 

Miles/Mile 

Miles 

Miles 

LLs 

Ft/Ft 

1169500 

Saratoga Creek 
at Saratoga 

9.2 

42.0 

2.85 

5.11 

.610 

1.80 

.0500 

9.47 

1.03 

2.84 

.485 

64.9 

.2765 

1160000 

Sequel Creek 
at Soqooi 

40.2 

40 

3.31 

15.91 

.437 

2.53 

.015 

62.92 

1.57 

8.90 

. 320 

1156.2 

— 

SCVWD *32 

Upper Perranente 

8.2 

24 

3.67 

7.01 

.515 

1.17 

.0354 

18.37 

2.24 

3.60 

.223 

134.0 

.2376 

1154000 

Uvas Creek at 
Morgan Hill 

30. 3 

37.4 

4.02 

10.98 

.515 

2.76 

.0207 

59.09 

1.95 

5.34 

.256 

407.6 

.2448 

1153900 

Uvas Creek above 

Uvas Reservoir 

21.0 

39.0 

3.83 

8.90 

.767 

2.36 

.0234 

43.18 

2.06 

3.79 

.24 3 

220.5 

.2353 

1159800 

West Branch 

Soquel Creek 

12.2 

45 

2.50 

9.28 

.499 

1.32 

.0224 

16.89 

1.82 

3.98 

.275 

246.8 

.244 

SCVWD #30 

Wildcat Creek 

4.12 

2- 

2.50 

4.73 



.0 320 



2.58 


68.2 


1160300 

Zayante Creek 

11.1 

50 

4.00 

6.82 

.6165 

1.63 

.0236 

19.89 

1.79 

3.41 

.279 

151.4 



1 



1. The deviations of the dependent variable about the 
regression line are normally distributed. 

2. Values of dependent variables are uncorrelated. 

3. The variables are homogenious, i.e., variables should 
be measured the same way, and statistically independent. The 
assumption of normality is required for the direct application 
of the standard error theories in statistics and the computation 
of confidence limits and tests of significance. 

It is evident that the hydrologic data used in regression 
and correlation cannot, by any means, satisfy all of these 
assumptions. Therefore, the resultant regression equations 
should be used under the implied assumption, that personal 
experience in interpreting the results should be utilized as 
a major input. 

For these reasons, and in order to limit and guide the 
personal judgement a confidence band (as will be discussed 
later) should always be associated with the results of regression 
operations. 

The log transformation of the dependent variables (Table 3) 
and the independent variables (Table 4) was used in the regres¬ 
sion. Although this transformation could distort basic infor¬ 
mation, it was thought that the benefits of the log transforma¬ 
tion as a process of normalizing the data and linearizing the 
regression model to provide equal deviation about the regression 
line ,far exceed the small loss of information. 

Stepwise multiple linear regression is a satisfactory 
technique for eliminating variables of less significance, if 
the independent variables are carefully selected on a logical 
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basis. The use of too many variables reduces the degrees of 
freedom, and, hence, the correlation coefficient. Difficult 
to measure and relatively unimportant independent variables 
together with variables of less significance were not considered 
in the final results of the regression. 

The following is the form of the general regression equation 

Log Y = a log A + b log B + c + — + k 

Where Y is the dependent variables 

A,B,C- are independent variables, 

a,b,c- are regression coefficients and 

k is the regression constant. 

The regression equation of the above form is derived by 
least squares fitting, the result of which is an establishment 
of values of regression constants and coefficients in such a 
way that the determination coefficients are maximized and the 
standard errors minimized. 

Elimination of independent variables by the stepwise 
method was done based on the smallest partial determination 
coefficient. 

Review of the Procedures 

Among other statistics, the results of the HEC regression 
program consists of the following information (reference is made 

to Appendix C): 

1. Simple correlation coefficients between parts of in¬ 
dependent variables. One of the practical requisites in 
multiple-regression analysis is that the various independent 
variables not be highly correlated amongst themselves. It is 
therefore, the intention of this study to choose the variables 
in such a way that this measure is minimized. 
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2. Partial determination coefficient. It is the measure 
used to eliminate the least significant independent variable in 
the stepwise process. 

3. Regression coefficient for each independent variable. 

4. Regression Constant. 

5. Multiple determination coefficient both actual and 
adjusted. The adjustment is used to remove the bias introduced 
by less degress of freedom. The maximization of this function 
is one measure used in the final choice of the independent 
variables. 

6. The standard error is the last measure, the minimiza¬ 
tion of which is used in the choice of the independent variables 

The HEC regression program eliminates the least significant 
independent variable in the computed regression equation, then 
proceeds to recompute successive equations without that variable 
This procedure was repeated until only one independent variable 
remains. This made it possible to determine when no significant 
improvement in the results was obtained by the use of additional 
variables. 

Normally, the independent varicibles, drainage area and 
precipitation, explain most of the explainable variation in 
flood magnitude. Other variables should be used only when they 
improve results by an amount that justifies the necessary data 
collection. 

Of the 13 original variables, three were eliminated im¬ 
mediately because the numerical determination of them was 
difficult and preliminary regression runs indicated they did 
not significantly improve results. They are; So, average basin 
slope; d, overland flow; and DD, drainage density. 
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Preliminary Selection of Stations and Their Respective 
Significant Physical Parameters 

The analysis first considered the statistics of 32 gaging 
stations and thirteen independent variables for each station. 

It was anticipated that, as the study progressed, some of these 
stations and most of those parameters would be reduced. This 
chapter discusses these processes and the final choice of 
stations, parameters, and regression equations. 

A few of the stream gage stations were eliminated immediately 
as follows; Dry Creek, Pacheco Creek, San Tomas and Wildcat 
Creek; each had poor data. As previously mentioned, drainage 
density (DD), overland flow (d) and the average basin slope 
(So) were immediately eliminated because of their interdependence 
and the difficulty to compute. The parameter LLS was found to 
adequately combine all the physical features represented by these 
parameters. 

Regression for the Mean, M 

In the initial regression runs, 25 stations and 9 inde¬ 
pendent variables were used with the following results for 
the Mean: 


RUN 

#1 



Rb, 

d/Lm, 



R 

Square 

Standard 

Error 

Sm, 

P, 

A, 

Lea, 

Ls, 

Wb, Lm 

.880 

.244 

Sm, 

P, 

A, 

Lea, 

Rb, 

d/Lm, 

Ls, 

Wb 

.880 

.237 

Sm, 

P, 

A, 

Lea, 

Rb, 

d/Lm, 

Ls 


.880 

.230 

Sm, 

P, 

A, 

Lea, 

Rb, 

d/Lm 



.880 

.224 

Sm, 

P, 

A, 

Lea, 

Rb 




.879 

.218 

Sm, 

P, 

A, 

Lea 





.873 

.218 

Sm, 

P, 

A 






.835 

.242 

S.m, 

P 







.780 

.274 

Sm 








.412 

.437 


Simple Correlation Coefficients 


Variable 

Area 

Map 

Rb 

Lm 

d/Lm 

Area 

1.0000 

0.1362 

0.1706 

0.9609 

-0.3820 

Map 

0.1362 

1.0000 

0.0000 

0.0900 

0.0000 

Rb 

0.1706 

0.0000 

1.0000 

0.1730 

0.0000 

Lm 

0.9609 

0.0900 

0.1730 

1.0000 

-0.4761 

d/Lm 

-0.3820 

0.0000 

0.0000 

-0.4761 

1.0000 

Wb 

0.9076 

0.1330 

0.0946 

0.7545 

-0.1265 

Sm 

-0.7126 

0.0000 

-0.1354 

-0.6715 

0.1122 

Ls 

0.9841 

0.0000 

0.2750 

0.9550 

-0.3704 

Lea 

0.9254 

0.1854 

0.2262 

0.9679 

-0.3994 

Mean 

0.7556 

0.5802 

0.0000 

0.6765 

-0.1762 

Variable 

Wb 

Sm 

Ls 

Lea 

Mean 

Area 

0.9076 

-0.7126 

0.9841 

0.9254 

0.7556 

Map 

0.1330 

0.0000 

0.0000 

0.1854 

0.5802 

Rb 

0.0946 

-0.1354 

0.2750 

0.2262 

0.0000 

Lm 

0.7545 

-0.6715 

0.9550 

0.9679 

0.6765 

d/Lm 

-0.1265 

0.1122 

-0.3704 

-0.3994 

-0.1762 

Wb 

1.0000 

-0.6618 

0.8788 

0.7185 

0.7576 

Sm 

-0.6618 

1.0000 

-0.6818 

-0.6467 

-0.6218 

Ls 

0.8788 

-0.6818 

1.0000 

0.9220 

0.6871 

Lea 

0.7185 

-0.6467 

0.9220 

1.0000 

0.6426 

Mean 

0.7576 

-0.6218 

0.6871 

0.6426 

1.0000 


A second trial run eliminated Lea, Lm, and Sm in favor of the 
combined factor (LLS) . The results for the Mean , M were: 


RUN 

t-2 



R 

Square 

Standard 

Error 

Wb, 

P, 

LLS, 

Ls, Rb, d/Lm, A 

.823 

.280 

Wb, 

P, 

LLS, 

Ls, Rb, d/Lm 

.823 

.272 

Wb, 

P, 

LLS, 

Ls, Rb 

.821 

.265 

Wb, 

P, 

LLS, 

Ls 

.815 

.263 

Wb, 

P, 

LLS 


.799 

.268 

Wb, 

P 



.777 

.276 

Wb 




.592 

.365 


It is obvious from these two trials that the shape factor 
d/Lm is of minor importance and being somewhat difficult to 
compute was therefore discarded. The high correlation of the 
four "size" parameters (Lm, Wb, Ls, Lea) with Area caused the 
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reordering in the second trial and the reduction of importance 
of the single Area factor. In the third trial therefore these 
four factors were eliminated with the following results: 


ruu #3 

R 

Square 

Standard 

Error 

Sm, p, A, LLS, Rb 

.874 

.222 

Sm, P, A, LLS 

.867 

.223 

Sm, P, A 

.835 

.242 

Sm, P 

.780 

.274 

Sm 

.412 

.437 

removing LLS as a factor 

R 

Square 

Standard 

Error 

Sm, P, A, Rb 

.848 

.239 

Sm, P, A, 

.835 

.242 

Sm, P 

.780 

.274 

Sm 

.412 

.437 


There are a few streamflow stations whose records were con¬ 
sidered to be questionable. In an effort to increase the reliability 
of the regression analysis these were systematically removed until 
an optimum was reached. The results with 20 stations were: 





R 

Standard 




Square 

Error 

A, 

P, 

Sm, Rb 

.886 

.173 

A, 

P, 

Sm 

.874 

.17 6 

A, 

P, 


.842 

.191 

A 



.756 

.230 


It is obvious from these results that the inclusion of other 
variables in addition to A and P causes little improvement 
in the coefficient of determination or the standard error. It was 
decided, therefore, that the final regression equation of the means 
should be based on the area and the mean annual precipitation. 
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Regression for the Standard Deviation 

For determination of the independent variables for the 
Standard Deviation, the same procedure of Runs #1 to #3 as for the 
Mean was followed. The results for the Standard Deivation were: 


RUN #1 





R 

Square 

Standard 

Error 

P, 

Ls, 

Sm, 

A, Lm, d/Lm, 

Lea, 

Wb, Rb 

.456 

.123 

P, 

Ls, 

Sm, 

A, Lm, d/Lm, 

Lea, 

Wb 

.455 

.119 

P, 

Ls, 

Sm, 

A, Lm, d/Lm, 

Lea 


.454 

.116 

P, 

Ls, 

Sm, 

A, Lm, d/Lm 



.452 

.113 

P, 

Ls, 

Sm, 

A, Lm 



.449 

.110 

P, 

Ls, 

Sm, 

A 



.445 

.108 

P, 

Ls, 

Sm 




.442 

.106 

P, 

Ls 





.429 

.104 

P 






.365 

.108 

RUN #2 





R 

Square 

Standard 

Error 

P, 

Ls, 

Wb, 

LLS, d/Lm, A, 

Rb 


.453 

.116 

P, 

Ls, 

Wb, 

LLS, d/Lm, A 



.453 

.113 

P, 

Ls, 

Wb, 

LLS, d/Lm 



.449 

.110 

P, 

Ls, 

Wb, 

LLS 



.448 

.108 

P, 

Ls, 

Wb 




.441 

.106 

P, 

Ls 





.429 

.104 

P 






.365 

.108 

RUN #3 





R 

Square 

Standard 

Error 

P, 

LLS 

, Sm 

, A, Rb 



.439 

.111 

P, 

LLS 

, Sm 

, A 



.438 

.108 

P, 

LLS 

, Sm 




.437 

.106 

P, 

LLS 





.420 

.105 

P 






.365 

.108 
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In an effort to increase the reliability of this regression 
analysis, five streamflow stations with suspect data were removed. 
The results of this analysis for the remaining 20 stations with 


only the three 

parameters A, P, and 

LLS were: 




R 

Standard 



Square 

Error 

P, LLS, A 


.425 

.081 

P, LLS 


.425 

.086 

P 


.263 

.078 

The final 

regression equations 

for the Standard Deviation 

should therefore, be based on the mean annual 

precipitation, the 


basin factor and Area. 

Regression for the 1% Peak Directly 

A separate regression analysis was made using the 1% peak 
value as the dependent variable. Much of the District's work re¬ 
quires only the 1% peak value so that a regression of the 1% peak 
directly might give a better estimate than the mean and standard de¬ 
viation. Proceeding as for the previous analysis, the results were: 


RUN 

#1 






R 

Standard 








Square 

Error 

Ls, 

Sm, 

Rb, 

Lea, 

P» 

Lm, 

Wb, d/Lm, A 

.821 

.300 

Ls, 

Sm, 

Rb, 

Lea, 

P, 

Lm, 

Wb, d/Lm 

.820 

.290 

Ls, 

Sm, 

Rb, 

Lea, 

P, 

Lm, 

Wb 

.819 

.281 

Ls, 

Sm, 

Rb, 

Lea, 

P, 

Lm 


.817 

.273 

Ls, 

Sm, 

Rb, 

Lea, 

P 



.811 

.269 

Ls, 

Sm, 

Rb, 

Lea 




.800 

.269 

Ls, 

Sm, 

Rb 





.788 

.269 

Ls, 

Sm 






.761 

.279 

Ls 







.723 

.293 
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RUN #2 


R Standard 


A, Rb, 

d/Lm, Wb, P, LLS, 


Square 

Error 

Ls 

.774 

.313 

A, Rb, 

d/Lm, Wb, P, LLS 


.774 

.304 

A, Rb, 

d/Lm, Wb, P 


.773 

.295 

A, Rb, 

d/Lm, Wb 


.772 

.287 

A, Rb, 

d/Lm 


.770 

.281 

A, Rb 



.763 

.278 

A 



.754 

.276 

RUN #3 



R 

Standard 

A, Sm, 

LLS, Rb, P 


Square 

Error 


.789 

.272 

A, Sm, 

LLS, Rb 


.783 

.269 

A, Sm, 

LLS 


.772 

.269 

A, Sm 



.756 

.272 

A 



.735 

.277 

Removing questionable streamflow 

data and 

using the same 20 

stations as for the Mean 

and Standard 

Deviation 

the results were 




R 

Standard 




Square 

Error 

A, Sm, 

P 


. 929 

.144 

A, Sm 



.926 

.142 

A 



.917 

.146 


Therefore, the regression equation for the 1% peak flow should be 


based on the area, the slope and the mean annual precipitation. 

The additional improvement resulting from the use of the 
Mean Annual Precipitation, P was small.. However, the Mean Annual 


Precipitation was tested separately without Sm, because A & P were 
the only two factors used in calculating the Mean, M. 


A, P 
A 


R 

Standard 

Square 

Error 

.917 

.150 

.917 

.146 
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In each case (i.e. in the case of the mean, standard deviation 
and 1% peak), the exclusion of several questionable stream gages 
greatly improved the reliability of the regression analysis. The 
independent variables also arranged themselves in a much more 
logical fashion. This improvement illustrates the importance of 
good data. It is absolutely necessary that each station be care¬ 
fully scrutinized. 
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Determination of Final Regression Equations 
The primary independent variables input to the regression 
equations were determined in the preliminary analysis described 
in the previous section. To further eliminate the impact of 
questionable data, different combinations of stations were 
tested separately, adding or excluding stations which were most 
suspect. During the course of this work, it was noticed that 
standard errors of regression could be improved by eliminating 
some stations and adding others. 

To the 20 stations used in the previous analysis, three 
SCVWD stream gages Fisher Creek, Metcalf Creek, and Calabazas 
Creek were added for a total of 23 stations. This was done 
in order to stress the applicability of regression to areas 
similar to those stations, although it is known that the data 
of these stations is questionable. The following regional 
equations and their measure of significance resulted: 


Equation 




R 

Square 

Standard 

Error 

(EQN #1) 

M = 

-.338 + .85 log A + 

1.41 log p 

. 850 

.260 

(EQN #2) 

a = 

1.45 P - - 42 LLS'° 6 


.468 

.074 

(EQN #3) 

1% = 

19.4 A -95 P" 58 Sm“' 

25 

.904 

.218 


The Coyote-Madrone station was suspect because of its very 
high standard deviation, so it was removed from the group of 23 
stations and the resulting equations were: 



(EQN #4) M = -.300 + .87 log A + 1.38 log P 
(EQN #5) o = 1.29 P - - 36 LLS'° 4 
(EQN #6) 1% = 7.36 A* 83 P’ 85 Sm"' 33 



. 841 
.352 


. 896 


Standard 

Error 

.267 

.072 

.210 
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The Zayante Creek station was eliminated next because of 
its high standard deviation, so that there were only 21 stations 


in this 

analysis. The results for these 21 

stations 

were: 



R 

Standard 

Equation 


Square 

Error 

(EQN #7) 

M = -.415 + .85 log A + 1.47 log P 

.843 

.272 

(EQN #8) 

a = 1.67 p"* 46 LLS 

.507 

.063 

(EQN #9) 

1% = 11.19 A* 86 P' 73 Snf' 30 

. 898 

.213 


It was very desirable to keep the .8 square mile Metcalf 
Creek station because the equations are most often applied to 
small areas. However, there was some question of the reliability 
of the data, so the station was removed. With the remaining 20 
stations the final results were: 


Equation 



R 

Square 

Standard 

Error 

(EQN #10) 

M = .148 

+ .68 log A + 1.25 log P 

.817 

.210 

(EQN #11) 

o = 1.65 

P - * 45 LLS* 05 

.502 

.065 

(EQN #12)' 

1% = 238 

A' 84 P- 23 

.918 

. 128 


The regression coefficients are tending toward values which 
generally appear to be reasonable and acceptable. Therefore, 
further adjustments were not attempted. Equations 10, 11, and 12 
have the best combination of multiple determination coefficients 
and standard errors, and therefore were accepted for the peaks. 

Table 5 compares the results of these regression equations 
with the gage statistics for each of the 20 stations used in 
the development. The mean and standard deviation for the gage 
data are the "adopted" statistics from the final HEC Regional 
Frequency Analysis Program. The -.6 skew is input as a regional 
constant (this is more fully described in the next chapter). 
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The other skew values included for comparison are from the 

recorded and estimated data of the regional frequency program. 
Similiar analyses were conducted for the volumes and are re¬ 
viewed in Appendix D of this report. All the resultant equa¬ 
tions are summarized in Table 6. 
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TABLE 5 


COMPARISON OF RESULTS OF EQUATIONS 
PEAK 

Statistics from Recorded 

and Estimated Data M 



Station 


Regional 

a (EQN# 1 


Adopted 

Skew 


1) 

APTOS 




D . A . = 12.2 

M= 2.621 


M=2.832 


M.A.P. = 36 

0 - .536 


0= .436 


LLS = 280.8 

g=- . 37 

g=- . 6 

1%= 4484 


Sm = .0235 

1%=5310 

1%=4252 


2) 

BODFISH CREEK 





D . A . = 7.4 

M=2 . 448 


M=2.585 


M . A.P . = 30 

o= .488 


CT= .438 


LLS =59.4 

g=- . 23 

g=- . 6 

1%=2561 


Sm = .0327 

1%=3177 

1%=2319 


3) 

BRANCIFORTE CREEK 





D. A. = 17.3 

M= 3.130 


M=2.992 


M.A.P. = 40 

c= .357 


C= .419 


LLS = 328.3 

g=- . 42 

g=- . 6 

1%=6021 


Sm = .0172 

1%=7156 

1%=6326 


4) 

CALABAZAS CREEK 





D . A. = 4.73 

M= 2.479 


M=2.332 


M.A.P. = 24 

0= .449 


o= .493 


LLS = 83.6 

g=-.14 

g=- .6 

1%= 1014 


Sm = .0278 

1%=3022 

1%=2104 


5) 

CORRALITOS CREEK 





D . A. = 10.6 

M=2.615 


M=2.848 


M.A.P. = 40 

a= .477 


a= .392 | 


t,t,,c = 37.8 

g=- . 30 

g=- . 6 

1%=38 4 5 J 


Sm = .0395 

1%=4182 

1%=324 9 


6) 

COYOTE CREEK 





near Gilroy 





D.A. = 109.0 

M= 3.365 


M=3.259 


M.A.P. = 24 

o= .560 


a= .583 


LLS = 2405.0 

g=-.77 

g=-. 6 

1%=22,648 


Sm = 0.0141 

1%=22,418 

1%=26,169 


7) 

FISHER CREEK 





D.A. = 15.5 

M=2.263 


M=2.584 


M.A.P. = 20 

O— .663 


a= .591 


LLS = 617.0 

g=-.43 

g= -. 6 

1%=4955 


Sm = .0025 

1%=3941 

1 % = 3 2 31 



Equations 
1 % 


1 %-- 


1 %= 


1 %= 


1 %= 


1 %= 


1 9-- 

-L - 


(EQN# 2) 


4352 


2738 


5987 


1737 


3954 


25,004 


4690 
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TABLE 5 (Continued) 


COMPARISON OF RESULTS OF EQUATIONS 
PEAK 


Station 


Statistics from Recorded 
and Estimated Data 
Regional 

Adopted Skew 


8) LOS GATOS CREEK 

D.A. = 38.6 
M.A.P. = 43 
LLS = 514.0 
Sm = .0260 

9) MATADERO CREEK 

D.A. = 7.26 
M.A.P. = 19 

LLS = 162.0 
Sm = .0138 

10) PERMANENTE CREEK 

D.A. = 15.9 
M.A.P. = 21 
LLS = 309.9 
Sm = .0226 

11) REDWOOD CREEK 

D.A. = 1.82 
M.A.P. = 19 
LLS = 15.4 
Sm = .0236 


12) ROSS CREEK 

D.A. = 2.86 
M.A.P. = 25 
LLS = 8.3 

Sm = .0360 

13) SAN FRANCISQUITO 

D.A. = 37.5 
M.A.P. = 31 
LLS = 606.7 
Sm = .0129 


M=3.268 
0= .421 
g=-1.40 
1%=6663 

g=-. 6 
1%=11,467 

M=2.456 
o= .469 
g=-.92 
1%=1716 

g=-. 6 
1%=2176 

M= 2.296 
0= .699 
g=-l.47 
1%=1527 

g=-6 
1%=407 4 

M= 2.2 91 
o= .451 
g=-l.09 
1%=967 

g=-. 6 
1%=137 6 


M=3.044 
0= .551 

g=-1.03 g=-.6 ' 

1%=8 000 1%=12,019 


M (EQN#10) 1 % (EQN# 2) 

a (EQN# 11) 

(q=-.6) _ 


M= 3.269 
0= .415 

1%=11,199 1%=11,926 


M=2.332 
a= .566 

1%=248 9 1%=2434 


M= 2.618 
a= .559 

1%=4666 1%=4803 


M=l.923 
a= .503 

1%= 739 i%=7 60 


M=2.206 
o= .431 

1%=1038 l%=1181 


M=3.083 
o= ,485 

1%=9,880 1%=10,826 


M= 2.188 
o= .413 

g= -.15 g=-.6 

1%=1273 1%=921 
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TABLE 5 (Continued) 


COMPARISON OF RESULTS OF EQUATIONS 
PEAK 

Statistics from Recorded 

and Estimated Data M 


14) 


16) 


17 ) 


18 ) 


19 ) 


20 ) 


Station 

Adopted 

Regional 

Skew 

O (EQN# 11 
(q=—. 6) 

SAN LORENZO CREEK 



at Hayward 

D.A. = 37.5 

M=2.987 


M=2.944 

M.A.P. = 24 

0= .579 


0= .527 

LLS = 324.1 

g=-.7 2 

g=-. 6 

1%=8605 

Sm = .0152 

1%=10,695 

1%=11,899 


SAN LORENZO RIVER 

D.A. = 111.0 

M= 3.726 


M= 3.640 

M.A.P. = 48 

a= .400 


0= .428 

LLS = 2519.6 

g=-.57 

g=-. 6 

1%- 27,838 

Sm = .0071 

1%=31,189 

1%=30,060 


SARATOGA CREEK 

D.A. = 9.2 

M=2.598 


M= 2.832 

M.A.P. = 42 

a= .364 


a= .378 

LLS =64.9 

g=+.06 

g=-. 6 

1%=3488 

Sm = 0.0500 

1%=2888 

1%=1915 


SOQUEL CREEK 

D.A. = 40.2 

M= 3.295 


M= 3.241 

M.A.P. = 40 

a= .411 


o= .446 

LLS =1156.2 

g=-.53 

g=-. 6 

1%=12,008 

Sm = .015 

1%=12,370 

1%=11,686 


UVAS CREEK 
at Morgan Hill 

D.A. = 30.3 

M=3.391 


M= 3.121 

M.A.P. = 37.4 

a= .348 


0= .437 

LLS = 407.6 

g=-.36 

g=-. 6 

1%=87 61 

Sm = 0.0207 

1%=12,820 

1%=11,097 


UVAS CREEK 
above Reservoir 

D.A. = 21.0 

M=3.318 


M= 3.036 

M.A.P. = 39 

a= .364 


0= .416 

LLS = 220.5 

g=-.62 

g=-. 6 

1%=6578 

Sm = .0234 

1%=997 0 

1%=10,053 


WEST BRANCH 

SOQUEL CREEK 

D.A. = 12.2 

M=2.974 


M=2.953 

M.A.P. = 45 

a= .471 


0= .392 

LLS = 246.8 

g=-.58 

g=-. 6 

1%=4 8 97 

Sm = .0224 

1%=7394 

1%=7235 



Equations 

1% (EQN# 2) 


1 %- 10,200 


1%=29,860 


1%=3547 


1%=12,163 


1%=9432 


1%=6 996 


l%=-4 58 2 



TABLE 6 


SUMMARY 

OP REGRESSION EQUATIONS 


PEAK (cfs) 


M = .148 + .68 log A + 1.25 log P 


- 45 

cr = 1.65 P 

84 

1% = 238 A'° 


LLS 

,.23 


05 


R 

Square 


. 817 
.502 
.918 


24-HR Volume (average cfs) 

M = -.756 + .83 log A + 1.51 log P .891 

- 46 04 

Cf = 1.805 P ' D LLS' U .514 

90 64 - 15 

1% = 11.625 A* yU P*^ S m ' ±D .958 

1-Day Volume (1-Day average cfs) 


M = 

-1.039 

+ .88 

log A + 1.60 log P 

.904 

O' = 

1.752 

p-.46 

LLS' 05 

.462 

1% = 

= 8.558 

00 

cr> 

p.67 -.10 

m 

.969 


2-Day Volume (Sum of 2-Day Average cfs) 
M = -.898 + .88 log A + 1.61 log P 


cr 

1 % 


- 47 

1.872 P * 

96 

11.165 A'*° 


.04 


LLS 
.,.69 



3-Day Volume (Sum of 3-Day Average cfs) 

M = -.859 + .88 log A + 1.64 log P 

O' = 1.949 p - - 48 LLS' 04 

1% = 10.206 A' 95 P' 75 S - ' 15 

m 


.901 
.51 
. 965 


. 918 
.490 
.967 
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Standard 

Error 


.210 
.065 
. 128 

.188 

.063 

.111 

.186 

.063 

.100 

.179 

.064 

.097 


.173 

.068 

.102 




Small Basins 


In use, the regional regression equations would be applied 
to small tributary basins and then combined with adjacent basins 
and routed until the entire watershed has been included. Since 
the application would be mostly for smaller basins the 1% regres 
sion was tested excluding all stream gages with greater than 30 
square miles of drainage. The results with 14 stations are: 



R 

Standard 


Square 

Error 

A, P, Sm 

.804 

.162 

A, P 

.802 

.155 

A 

.782 

.155 

the regression equation 

is: 


(EQN #13) Q = 175 A* 

I/O 

76 p .36 


This equation should be 

used for checking 

only. 
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DETERMINATION OF SKEW COEFFICIENT 


Previously, the discussion included the determination of 
the two statistics, mean and standard deviation. The third 
statistic, needed to fully define a frequency curve, is the 
coefficient of skewness. This statistic is sensitive to the 
number of years of record and hence more difficult to calculate. 
Matalas, Nicholas and Benson found that the standard error of 
the skew coefficient can go as high as 0.45 with 25 years of 
record. This standard error value increases to 0.70 with only 
10 years of record and drops to only 0.25 with 100 years of 
record. For fifty years of record, the standard error is 0.35. 
In other words, for small samples the dispersion of the 
calculated skew coefficient around the skew obtained from the 
population prohibits casual application of this parameter. 

For this reason and because the average length of records 
in this area is less than 25 years, it was decided to use a 
regionalized skew coefficient where a significant improvement on 
the standard error could be achieved. This regional skew was 
determined by computing the arithmetic and weighted means 
of the skew factors for the recorded, and recorded and estimated 
data. Table 7 shows these values for various durations. The 
final regional skew factors selected for future studies were 
-0.6, -0.64, -0.66, -0.68, -0.69, -0.70, -0.72, and -0.74 for 
the peak,1-day, 2-day, 3-day, 4-day, 5-day, 10-day, and 15-day 
volumes, respectively. 

These values match the framework of the regional skew fac¬ 
tors as outlined by the W.R.C. It was contemplated at one stage 
of this study that isolines of equal skews in the District would 



1 tn | tn 


TABLE 7 


REGIONAL FREQUENCY ANALYSIS 


s. Skewness 

FROM RECORDED DATA ALONE 

N. Coeff. 



Statistics's. 
Criteria ^S. 

g 

g w 

Peak 

-0.991 

-0.980 

1-Day 

-0.799 

-0.789 

2-Day 

-0.834 

-0.813 

3-Day 

-0.857 

-0.850 

4-Day 

-0.863 

-0.861 

5-Day 

-0.844 

-0.848 

10-Day 

-0.840 

-0.858 

15-Day 

-0.827 

-0.856 


s. Skewness 

^s. Coeff. 

Statistic ss^ 
Criteria ^s. 

FROM RECORDED AND ESTIMATED DATA 

g 

^w 

Peak 

-0.600 

-0.612 

1-Day 

-0.640 

-0.643 

2-Day 

-0.630 

-0.664 

3-Day 

-0.680 

-0.679 

4-Day 

-0.690 

-0.687 

5-Day 

-0.700 

-0.698 

10-Day 

-0.720 

-0.723 

15-Day 

-0.74Q 

-0.738 


- Arithmetic Mean 
- Weighted Mean 
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be valuable to the determination of a regional skew factor. But 
after the calculated skew coefficients were plotted, a trend was 
not definable at all. So it was decided that until more data 
points are available, such an attempt could yield misleading 
results. 

The regional information should be supplemented by site 
specific information. Therefore the W.R.C. method of obtaining 
an adopted skew by weighing the regional skew with the skew 
obtained from recorded will be utilized. 

Due to the difficulty of determining skew with the existing 
records the next updating of the regression equations should 
concentrate on the skew coefficient. 

The following Table 8 shows the calculation of the various 
skew factors. A close look at the table reveals that the most 
likely skewness to include most of the information is the skew 
obtained from the averaged adopted skews (-.59 or -.60). This 
skew factor satisfied the requirement of the W.R.C. as applied 
to gaged locations and satisfies the criteria of using a 
weighted skew coefficient rather than the calculated one. 
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TABLE 8 


Adopted Regional Coefficient of Skewness 
(Calculated Regional Coefficient of Skew = -.60) 


Recorded, and Estimated Data Recorded Data Only 

Adopted! i Adopted 



Skew 

1 

Ne 

1 

g * 

| Skew 

N 

9 W * 

Aptos 

-.366 

58.3 

-.50 

-1.125 

17 

-.6 

Bodfish 

-.230 

54.0 

-.46 

-.411 

16 

-.6 

Branciforte 

-.419 

52.6 

-.53 

-.230 

19 

-.6 

Calabazas 

-.136 

55.1 

-.41 

+ .138 

35 

-.50 

Corralitos 

-.303 

56.9 

-.47 

-.920 

15 

6 

Coyote-Gilroy 

-.767 

58.7 

-.68 

-1.676 

15 

-.6 

Fisher 

-.432 

55.7 

-.53 

-.360 

23 

-. 6 

Los Gatos 

-.139 

55.5 

-.41 

-.566 

15 

-.6 

Matadero 

-.923 

56.8 

-.74 

-1.099 

22 

-.6 

Permanente 

-1.470 

57.8 

-.98 

-1.902 

23 

-.6 

Redwood 

-1.085 

58.6 

-.82 

-1.610 

16 

-.6 

Ross 

-.152 

51.7 

-.44 

-.427 

33 

-.58 

San Francisquito 

-1.030 

53.5 

-.76 

-1.805 

35 

-.76 

San Lorenzo Creek 

-.717 

53.3 

-.64 

-.456 

29 

-.59 

San Lorenzo River 

-.572 

55.7 

-.59 

-.348 

23 

-.6 

Saratoga 

.057 

52.2 

-.36 

+ .254 

42 

-.41 

Soquel 

-.530 

47.9 

-.58 

-.970 

24 

-.6 

Uvas MH 

-.359 

56.1 

-.50 

-.392 

27 

-.59 

Uvas Reservoir 

-.619 

41.4 

-.60 

-1.579 

14 

-.6 

West Br. Soquel 

-.575 

60.3 

-.59 

-.820 

14 

-.6 



Weighted = 

-.58 

Weighted = 

-.59 

*Weighted skew, g w = 

(N-25) Recorded + (1 - 
75 

N-25) 

75 

Regional 

(-. 6 ) 



Where N ^ 5 . 25 years. 
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CONFIDENCE LIMITS 


The accuracy of results obtained from equations derived 
from hydrologic data are limited by errors inherent in the data 
itself, and by the choice of a distribution function that defines 
the frequency of that data. Therefore the equations should be 
complemented by an associated standard error function, or an 
index of reliability called the confidence band. Similar to the 
hydrologic processes from which the equations were derived, the 
standard errors of the dependent variables of the equations 
(mean and standard deviation), follow certain distributions. 

In the case of the mean and standard deviation, the best 
suited distributions are the student "t" and the chi-square 
respectively. The choice of deviates from these distributions 
depends on the choice of a probability of occurrence. This 
probability is then called the confidence limit of our statisti¬ 
cal results. 

An 80% confidence limit or the 80% level of significance 
means that, regardless of whatever sample is chosen from the 
same population, there is an 80% certainty that the resultant 
value will be equal to or less than the calculated upper confi¬ 
dence limit, and equal to or more than the calculated lower con¬ 
fidence limit. The upper 80% limit could also be called the 
upper 10% confidence limit, which is similar in that there is 
only a 10% chance that the value will be greater than the calcu¬ 
lated upper 10% confidence limit value. 

Utilizing the known characteristics of the student "t" and 
chi-square distributions, the standard error of the 
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means e m (standard deviation of the means of many samples from 
the same population) is calculated by the following formula: 



Where a = Standard deviation. 

N = Number of years . 

Similarly the standard error of the standard deviation e 
(standard deviation of the standard deviations of many samples 
from the same population) is calculated as follows: 

2 

c = £ _ 

a . - 

V 2 N 

e 

The combined error of the mean and standard deviation is 
distributed according to a non-central "t" distribution which 
was adopted and used to adjust the already adopted frequency 
factor (k) as per the following formula: 

Kr 

Q = M + k ( /K) a 

Kc 

Where ( /K) is the adjusting ratio from a normal distri¬ 

bution (k) . 

As presented in the W.R.C. Bulletin #17, the same non¬ 
central "t" distribution is used and tables of Kc/K are avail¬ 
able. The question is what are the equivalent years of record 
(Ne) that should be attributed to the regression results to 
determine the non-central deviates? 

The number of gaging stations and their years of record 
are both factors which should be used in defining the equivalent 
number of years (station-years) used for the determination of 
a confidence band. 
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Assuming that annual flood series are correlated among 
stations, the equivalent number of stations, n , is given by the 
following formula: 

n = _n_ 

e 1 + r (n-1) 


n = number of stations 

r = average correlation coefficient of all 
correlation coefficients for a matrix of 
n (n-1) /2 different values. 

The equivalent number of station-years is, therefore, equal 


to: 


N = 

o 


No (n e ) 


No(n) _ 

1 + r (n-1) 


where; 

No = the average number of years of record of n stations. 

From the present correlation studies the average correlation 

coefficient r = .785 and the number of stations used in the 

study was 20. Therefore, the equivalent number of stations-(n ) 

6 

for the regional study is equal to: 

20 

n e = l + .785 (19) • 


Knowing the average number of equivalent years in each 

station. No = 55, the equivalent number of station-years, there 

fore, would be equal to: 

N = No x n 
e e 

= 55 x 1.256 

= 69.08 Say 69 years 
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Table 9 shows the appropriate ratios to be multiplied by 
the calculated deviates of a near normal frequency distribution 
to obtain the non-central deviates of a preassigned confidence 
band probability. These could be used in the computation of the 
upper 80% (10%) confidence band as obtained from the W.R.C. 
Bulletin #17 (the sample size = 69 years). 

TABLE 9* 

Probability k^/k 


. 001 

1.14 

.0 05 

1.14 

.010 

1.14 

.020 

1.15 

.050 

1.16 

. 100 

1.18 

. 200 

1.23 

. 400 

1.65 


k could then be calculated if the deviate is obtained from a 
c 

near normal distribution. 


*This table was adapted from the W.R.C. Bulletin #17. 


APPLICATION OF REGIONAL EQUATIONS 


General 

The formulas can be applied to gaged and ungaged water¬ 
sheds. It should be strongly emphasized that all available 
information must be evaluated before a decision on a final 
flow rate is made. Careless application of the equations could 
produce misleading results. Hydrology is after all as much an 
art as a science and numerical results must be thoroughly 
backed with experience and judgment. For ungaged watersheds 
the mean and standard deviation obtained from regression equa¬ 
tions are considered to contain the best available information 
and are used directly. For gaged watersheds though, the site 
specific results of the frequency analysis of recorded data 
could be weighted with the results from the regional equations. 
The weights to be used are those of the respective equivalent 
years of record obtained from the standard errors of the various 
statistics used, as given below. 

Equivalent Years of Record 

For the regression equations of the peaks, the standard 
errors of the mean and standard deviation are numerically known 
(Table 6 ). Hence the equivalent years of record for the 
mean are obtained from these standard errors as calculated by 
the following formula: 


2 
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Where e m , is the standard error of mean 
a is the standard deviation 

N e m is equivalent years of record of the mean 


Similarily, the .equivalent years of record of the standard 
deviation 


N 


e 


a 



2 


N Q is then multiplied by its own statistic. The statistics 

of recorded data is similarly multiplied by the number of years 

of record (N). The results are then added and divided by the 

summation of the N and N. 

e 

Consequently, the weighted mean of regional and recorded 
results is: 


M = 


M N + M . N 
~r ‘ e, m _s_ 

N m + N 
e ,m 


Where M is the weighted mean 

M r is the mean obtained from regression 

M is the mean obtained from recorded data 
s 


Similarly, 
a. 


- N 


*L- 

e ,g + a s 


N 


N 


e, 0 + N 


Where a, a and 0 are the weighted standard deviation, the 

3T S 

one obtained from regression and one from recorded data respectively. 


Sensitivity Analysis 

The values resulting from the application of the regional 
regression equations at the 20 base stations together with the 
statistical values of the recorded date are summarized in Table 
10 . 
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TABLE 10 


1976 REGIONAL EQUATIONS 


Base 




RECORDED 

DATA 


SE, Hear 

i - . 

210 

SE,o = 

.065 

WEIGHTED VALUES 

Weighted 



1976 

1976 

U.S.G.S. 

STATION 

Mean 

O' 

_2_ 

N 

Mean 


_ 

NE 

Mean 

NE 

o' 

Mean 

& 

g 

Values 

1% 

50% 


Regional Regional 
1% M,d 

Graphi- 
ca I ly 

Aptos 

2.616 

.583 

-1.125 

17 

2.832 

436 

-.6 

4.3 

22.5 

2.660 

.499 

-.6 

3,964 

512 


4,352 

4,484 

4,400 

Bodfish 

2.445 

.479 

- .411 

1C 

2.585 

438 

-.6 

4.4 

22.7 

2.475 

.455 

-.6 

2,140 

331 


2,738 

2,561 

2,400 

Brand forte 

3.176 

. 381 

- .230 

19 

2.992 

419 

-.6 

4.0 

20.8 

3.144 

,401 

-.6 

7,904 

1527 


5,987 

6,021 

6,000 

Calabazas 

2.485 

.431 

+ .138 

35 

2.332 

493 

-.6 

5.5 

28.8 

2.464 

,459 

-.50 

2,298 



1,787 

1,814 

1,700 

Corralitos 

2.639 

.538 

- .920 

15 

2.848 

392 

-.6 

3.5 

18.2 

2.679 

.458 

-.6 

3,468 

530 


3,954 

3,845 

4,200 

Coyote-Gilroy 

3.334 

.612 

-1.676 

15 

3.259 

583 

-.6 

7.7 

40.2 

3.309 

.591 

-.6 

26,307 

2331 


25,004 

22,648 

22,000 

Fisher 

2.337 

.523 

- .360 

23 

2.584 

.591 

-.6 

7.9 

41.3 

2.400 

.567 

-.6 

2,923 

286 


4,690 

4,955 

4,700 

Los Gatos 

3.261 

.441 

- .566 

15 

3.269 

.415 

-.6 

3.9 

20.4 

3.263 

.426 

-.6 

11,585 

2019 


11,926 

11,199 

12,000 

Matadero 

2.445 

.455 

-1.099 

22 

2.332 

.566 

-.6 

7.3 

37.9 

2.417 

. 525 

-.6 

2,535 

294 


2,434 

2,489 

2,500 

Permanente 

2.255 

.767 

-1.902 

23 

2.618 

,559 

-.6 

7.1 

37.0 

2.341 

.63 3 

-.6 

3,486 

254 


4,803 

4,666 

4, TOC 

Redwood 

2.310 

.403 

-1.610 

16 

1.92 3 

,503 

-.6 

5.7 

29.9 

2.208 

.468 

-.6 

1,224 

180 


760 

739 

570 

Ross 

2.195 

.392 

- .427 

33 

2.206 

431 

-.6 

4.2 

22.0 

2.196 

.408 

-.58 

931 



1,181 

1,038 

1,200 

Sail Francisquito 

3.155 

.531 

-1.805 

35 

3.083 

,485 

-.6 

5.3 

27.8 

3.146 

.511 

-.76 

11,127 



10,826 

9,880 

10,000 

San Lorenzo Cr. 

2.970 

.548 

- .456 

29 

2.944 

. 527 

-.6 

6.3 

32.9 

2.965 

.537 

-.59 

9,520 



10,200 

8,605 

9,500 

San Lorenzo Riv. 

3.778 

.452 

- .348 

23 

3.640 

.428 

-.6 

4.2 

21.7 

3.757 

.440 

-.6 

38,388 

6318 


29,860 

27,838 

29,000 

Saratoga 

2.621 

.379 

+ .254 

42 

2.832 

. 378 

-.6 

3.2 

16.9 

2.636 

. 379 

-.41 

2,525 



3,547 

3,488 

3,800 

ooquel 

5.345 

. 461 

- .9/0 

24 

3.241 

.446 

—. 6 

4.5 

23.5 

3.329 

.454 

-.6 

15,224 

2 366 


12,163 

12,008 

12,000 

Uvas MB 

3.375 

.402 

- .392 

27 

3.121 

.437 

-.6 

4.3 

22.6 

3. 340 

.418 

-.59 

13,464 



9,432 

8,761 

9,000 

Uvas Res. 

3.384 

.303 

-1.579 

14 

3.036 

.416 

-.6 

3.9 

20.5 

3. 308 

.370 

-.6 

10,083 

2211 


6,996 

6,578 

6,800 

West Br. Soquel 

2.956 

. 532 

- .820 

14 

2.953 

.392 

-.6 

3.5 

18.2 

2.955 

.453 

-.6 

6,407 

1000 


4,582 

4,897 

4,600 

NE - « 2 

vr — 

2 


(N-25) 



v;_25> 








+ 2i 


, 352 

+ = - 24,4 33 

He - (SE, 2 

— 

O 

g = 

75 

Recorded 

+ li 

75 


i + L 

— 18 

, 281 

+ A = 

26,9 

89 

= 27, 


2 (SE) ^ 




neyiuuai — 

hi 

= 31 

,979 

h! = 

38,189 

hi 

= 38 

,428 

!A j = 37,189 

NE, lca n = cr" 2 

- 

0 

O'" 







A 2 

= 124,450 

,427 

A ^ _ 

165,325,2 

99 

A 2 

= 216,647 

,948 

a 5 = 164,027,149 

(. 210) 2 


2(.065) 2 



















The best estimate of the desired flowrates is the weighted 


average of the value obtained from the correlation and extension 

of data and the value computed from the regional equations. The 

applicability of the various procedural possibilities is measured 

by the least square of the deviations from those average values. 

2 6 

1976 Regional Eqn. (1%) A = 7.7 x 10 

2 6 

U.S.G.S. Eqn. A = 25.5 x 10 

2 6 

1976 Regional Eqn. (M ) A = 29.1 x 10 

2 6 

Weicrhted Reaional and Recorded A = 103.6 x 10 


Weighted Regional and Recorded 
1972 Regional Eqn. 


A 2 = 519.0 x 10 6 


It appears that in general the 197b Regional Equations 
should be preferred. However, there are, most certainly, specific 
cases where this may not be true. Appendix E is a graphical 
representation of those results. 

Volumes Equations 

Designation of design floods includes the designation 
of peaks and their related volumes. A regression analysis 
study was done to determine the 24 Hr., 1-day, 2-day, and 3-day 
volumes and their frequency of occurrence. This study is explained 
in detail in Appendix D. 

To combine a flood peak that has an assigned frequency of 
occurrence with a flood volume that has another frequency of 
occurrence creates problems associated with the compounded 
probabilities as per the following formula. 

p (p,v 7 ) = P (P) x p (v 7 ) = p (F) 

X P 7 p 

Where P, is the probability of the peak and volumes 

,V /p ' occurring together 


is the probability of the peak 
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(V, ) 
x p 


is the probability of the volume given 
the peaks. 


P 


The value of P (V, ) would be around unity if the volumes 

/ P 

associated with the peaks are selected. 

Range of Application 

The areas of gaged basins used for the analysis range from 
0.8 square mile to 196 square miles and the basin average mean- 
annual precipitation ranging from 17 inches to 50 inches. This 
is an indication of the range of validity of the regional equations. 

Prom studying the distribution of the M.A.P.'s and Areas, it 
appears that the most applicable range is less than 50 square miles 
of area. Experience with small basins indicates that additional 
care is needed when small basins are encountered. Applicability 
of the mean annual precipitation is uniform for the full range 
of values encountered. 

In summary, the regression equations are best applied with 
the following limitations: 

1. Mean annual precipitation ranges from 12" to 56". 

2. The elevation range is from sea level to an elevation 
of 4000 feet, central pacific coast. 

3. Average slope of main channel ranges from .002 to .05 
ft./ft. 

4. Areas ranging from 1 to 200 square miles although for 
areas less than 30 square miles, the results should be checked 
with EQN #13. 
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APPENDIX D* 
Volumes Equations 


* Appendexes A, B, and C are listed under the Table of Contents 
and are available upon request. 
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Regression for 24-Hour Volume 


Maximum annual 1-day streamflows (usually corresponding to 
maximum annual peak) were extracted from the data and converted 
to 24-hour volumes through the relationship 

log 24 = .23 log Q peak + .77 log Q daily 

developed by the Corps of Engineers. The dependent variables 
(mean, standard deviation and 1% flow (g = -.64) of this recorded 
and reconstituted 24-hour data were calculated and listed in 
Table 1. 

The same 20 streamflow stations and 4 independent variables 
used in the analysis for the peak were used for the 24-hour volume. 

The final resulting regression equations for the 24-hour 
volumes are: 

R Standard 
Square Error 


M = 

-.756 + .83 

log A + 1.51 log P 

.891 

.188 

c = 

1.805 p"* 46 

LLS' 04 

.514 

. 063 

1% = 

90 

11.625 A^ u 

p.64 g -.15 
m 

.958 

.111 


The results of the application of these equations are com¬ 
pared with the results of the statistical analysis of recorded 
data at same location. Table 2 shows this comparison. 
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TABLE 1 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
24-HOUR VOLUME 



Mean 

Standard 

Deviation 

1% 24-Hr. Volume 
Avg. cfs 
(g = -.64) 

Aptos Creek* 

2 .2944 

.5062 

1701 

Bodfish Creek* 

2 .0698 

.5221 

1086 

Branciforte Creek* 

2 .8398 

.3562 

3153 

Calabazas Creek* 
SCVWD #31 

2 .0109 

.4083 

584 

Cedar Creek 

2.0676 

.8326 

4054 

Corralitos Creek* 

2.3573 

.4714 

1696 

Coyote Creek* 
near Gilroy 

3 .1455 

.5344 

13619 

Coyote Creek 
at Madrone 

3 .2054 

.7603 

40922 

Dry Creek 

1.8265 

.7370 

1549 

Fisher Creek* 

SCVWD #10 

1.9832 

.7158 

2030 

Hale Creek 

SCVWD #33 

1.5157 

.6124 

445 

Los Animas Creek 
SCVWD #56 

2 .0411 

.7186 

2347 

Los Gatos Creek* 

3 .0709 

.4106 

6769 

Matadero Creek* 

1.9820 

.4823 

749 

Metcalf Creek 

SCVWD #40 

.3890 

.3328 

10 

Pacheco Creek 

2 .7837 

.9588 

36097 

Packwood Creek 

SCVWD #57 

1.3202 

.7330 

474 

Permanente Creek* 
SCVWD #34 

1.8783 

.6542 

1226 

Redwood Creek * 

1.7585 

.4554 

399 


*The final 20 stations used for the peak regression analysis. 
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TABLE 1 


REGIONAL REGRESSION ANALYSIS (continued) 



STREAM GAGE 
24-HOUR 

STATISTICS 

VOLUME 



Mean 

Standard 

Deviation 

1% 24-Hr. Volume 
Avg. cfs 
(q = _ .64) 

Ross Creek* 

SCVWD #21 

1.6695 

.4587 

330 

San Francisquito* 

2 .7346 

.5509 

5672 

San Lorenzo Creek* 
at Hayward 

2 .6553 

.5718 

5166 

San Lorenzo River* 

3 .5286 

.3927 

17992 

San Tomas Creek 
SCVWD #29 

1 .7655 

.6374 

880 

Saratoga Creek* 
at Saratoga 

2 .3225 

.3506 

936 

Soquel Creek* 

3 .0438 

.4052 

6215 

Upper Permanente 
SCVWD #32 

1.5518 

.3771 

178 

Uvas Creek* 

at Morgan Hill 

3 .1012 

.3579 

5798 

Uvas Creek* above 
Uvas Reservoir 

2 .9440 

.3974 

4777 

West Branch 

Soquel Creek* 

2 .6144 

.4505 

2804 

Wildcat Creek 

SCVWD #30 

2 .0497 

.4211 

674 

Zayante Creek 

2 .5392 

.4974 

2880 


*The final 20 stations used for the peak regression analysis. 
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TABLE 2 

COMPARISON OF RESULTS OF PRELIMINARY EQUATIONS 
24-HOUR VOLUME 




Gage* 

Statistics 

Avg. c f s 

1976 STUDY 

24 HR. AVG. CFS. 


Station 

M.cr 
(q=- .64) 

1 % 

1) 

APTOS CREEK 

M=2.294 
&= .506 
1%=1701 

M=2.496 
.435 
1%= 1997 

1%=1920 

2) 

BODFISH CREEK 

M=2.070 
c£= .522 
1%=1086 

M=2.196 
&= .445 
1%= 1043 

1%=1037 

3) 

BRANCIFORTE CREEK 

M=2.840 
.356 

1 %=3153 

M=2.691 
o^ .417 
1%= 2899 

1%= 2 94 9 

4) 

CALABAZAS CREEK 

SCVWD #31 

M=2.Oil 
.408 
1%=584 

M=l.888 
.499 
1%= 649 

1%=616 

5) 

CORRALITOS CREEK 

M=2.357 
.471 
1%=1696 

M=2.514 
cr^ .396 
1%= 1762 

1%=1675 

6) 

COYOTE CREEK 
near Gilroy 

M=3.146 
.534 
1%=13619 

M=3.019 
.571 

1%=11,912 

1%= 11,482 

7) 

FISHER CREEK 

SCVWD #10 

M=1.983 
o*= .716 
1%=2030 

M=2.197 
.588 
1%= 1927 

1%= 220 9 

8) 

LOS GATOS CREEK 

M=3.071 
c^ .411 
1%=6769 

M=3.027 
.411 

1%= 6126 

1%= 5977 

9) 

MATADERO CREEK 

M=1.982 
.482 

1 %=749 

M=1.889 
CJ4: .571 
1%= 883 

1 %=866 ’ 

10) 

PERMANENTE CREEK 
SCVWD #34 

M=1.878 
.654 
1%=1226 

M=2.238 
.560 
1%= 1875 

1%= 17 37 


*24-Hour flows (from 1-day conversion log Q 24 = .23 log Qp ea |, 

daily^ 


+ .77 log Q 



TABLE 2 (continued 

COMPARISON OF RESULTS OF PRELIMINARY EQUATIONS 
24-HOUR VOLUME 


Station 


Gage* 

Statistics 
Avg. c f s 


1976 STUDY 
24 HR. AVG. CFS. 


M.cr' 

(g=--64 ). 


1 % 


11) REDWOOD CREEK 


M=1.759 
0= .455 
1%=399 


M= 1.391 
.520 
1%= 225 


1%= 23 0 


12) ROSS CREEK 
SCVWD #21 


M=1•670 
.459 
1%=330 


M=1.734 
0^= .447 
1%= 363 


1 %=387 


13) SAN FRANCISQUITO 
CREEK 


M=2.735 
<f= .551 
1%=5672 


M=2.802 
.481 

1%= 4914 1%= 5247 


14) SAN LORENZO CREEK 
at Hayward 


M=2.655 
c#= .572 
1%=5166 


M=2.635 
.527 

1%= 4073 1 %=4346 


15) SAN LORENZO RIVER 


M=3.529 
&= .393 
1%=17992 


M=3.480 
.416 

1 %= 17,783 1 %= 20,160 


16) SARATOGA CREEK 
at Saratoga 


M=2.323 
c^ .351 
1%=936 


M=2.495 
cr*= .382 
1%=1593 


1%=1468 


17) SOQUEL CREEK 


M=3.044 
0= .405 
1%=6215 


M= 2.995 
.439 
1 %=6397 


1%= 64 28 


18) UVAS CREEK 

at Morgan Hill 


M=3.101 
O'— .3 58 
1%=57 98 


M=2.849 
.434 

1%= 4480 1 %=4549 


19) UVAS CREEK 

above Reservoir 


M=2.944 
&= .397 
1%=4777 


M=2.744 
c# .415 

1%= 3252 1%=3298 


2 0) WEST BRANCH 
SOQUEL CREEK 


M=2.614 
o^ .451 
1%=2804 


M= 2.642 
.391 

1%= 2315 1%=2232 


*24-Hour flows (from 1-day conversion log Q 

+ .77 log Q dally ) 


24 


.23 log Q 


peak 
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1-Day, 2-Day, and 3-Day Volumes 

Tables 3, 4 and 5 are the statistics obtained from 
recorded and estimated data for the maximum 1-day, 2-day, and 
3-day volumes. These are the dependent variables used in the 
regression. 

The independent variables are the same used for the peaks 
and 24-hour volumes. 

A summary of the resultant regression equations and their 
level of significance are shown in Table 6 of the main text. 
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TABLE 3 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
1-DAY AVERAGE CFS 



Mean 

Standard 

Deviation 

1% 1-Day Volume 
(g = -.64)** 

Aptos Creek* 

2.197 

.508 

1370 

Bodfish Creek* 

1.957 

.505 

778 

Branciforte Creek* 

2.753 

.352 

2536 

Calabazas Creek* 
SCVWD #31 

1.872 

.436 

477 

Cedar Creek 

1.980 

.837 

3376 

Corralitos Creek* 

2.280 

.470 

1410 

Coyote Creek* 
near Gilroy 

3.080 

.544 

12201 

Coyote Creek 
at Madrone 

3.124 

.768 

35059 

Dry Creek 

1.714 

.736 

1190 

Fisher Creek* 

SCVWD #10 

1.902 

.684 

1470 

Hale Creek 

SCVWD #33 

1.320 

.799 

628 

Los Animas Creek 
SCVWD #56 

1.953 

.720 

1927 

Los Gatos Creek* 

3.012 

.414 

5996 

Matadero Creek* 

1.841 

.472 

518 

Pacheco Creek 

2.757 

.911 

27692 

Packwood Creek 

SCVWD #57 

1.180 

.833 

526 

Permanente Creek* 
SCVWD #34 

1.737 

.707 

1109 

Redwood Creek* 

1.597 

.447 

265 


*The final 20 stations used for the peak regression analysis 

**For skew g = -.64 K = 1.850 




TABLE 3 (Continued) 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
1-DAY AVERAGE CFS 



Mean 

Standard 

Deviation 

1% 1-Day Volume 
(g = -.64)** 

Ross Creek* 

SCVWD #21 

1.516 

.452 

225 

San Francisquito* 

2.642 

.561 

4785 

San Lorenzo Creek* 
at Hayward 

2.556 

.566 

4009 

San Lorenzo River* 

3.470 

.397 

16012 

San Tomas Creek 
SCVWD #29 

1.623 

.702 

835 

Saratoga Creek* 
at Saratoga 

2.240 

.358 

798 

Soquel Creek* 

2.969 

.417 

5501 

Upper Permanente 
SCVWD #32 

1.455 

.390 

150 

Uvas Creek* 

at Morgan Hill 

3.015 

.359 

4777 

Uvas Creek* above 
Uvas Reservoir 

2.832 

.397 

3685 

West Branch 

Soquel Creek* 

2.507 

.456 

2242 

Wildcat Creek 

SCVWD #30 

1.915 

.438 

531 

Zayante Creek 

2.454 

.513 

2529 


*The final 20 stations used for the peak regression analysis 

**For skew g = .64 K = 1.850 
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TABLE 4 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
SUM OF 2-DAY AVERAGE CFS 




Standard 

1% 2-Day Volume 


Mean 

Deviation 

(g = -.66)** 

Aptos Creek* 

2.372 

.474 

1747 

Bodfish Creek* 

2.124 

. 522 

1208 

Branciforte Creek* 

2.917 

.344 

3536 

Calabazas Creek* 

2.044 

.425 

667 

SCVWD #31 

Cedar Creek 

2.156 

.755 

3484 

Corralitos Creek* 

2.460 

.461 

2025 

Coyote Creek* 

3.278 

.516 

16802 

near Gilroy 

Coyote Creek 

3.315 

.710 

41549 

at Madrone 

Dry Creek 

1.883 

.703 

1492 

Fisher Creek* 

2.070 

.715 

2414 

SCVWD #10 

Hale Creek 

1.474 

.808 

907 

SCVWD #33 

Los Animas Creek 

2.109 

.709 

2575 

SCVWD #56 

Los Gatos Creek* 

3.190 

.398 

8332 

Matadero Creek* 

1.98 9 

.474 

723 

Pacheco Creek 

2.937 

.898 

38523 

Packwood Creek 

1.377 

.752 

572 

SCVWD #57 

Permanente Creek* 

1.908 

.714 

1655 

SCVWD #34 

Redwood Creek* 

1.736 

.444 

356 


*The final 20 stations used for the peak regression analysis. 

**For skew g = -.66 K = 1.836 
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TABLE 4 (Continued) 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
SUM OF 2-DAY AVERAGE CFS 



Mean 

Standard 

Deviation 

1% 2-Day Volume 
(q = -.66)** 

Ross Creek* 

SCVWD #21 

1.700 

.481 

383 

San Francisquito* 

2.813 

.573 

7329 

San Lorenzo Creek* 
at Hayward 

2.721 

.550 

5380 

San Lorenzo River* 

3.636 

.391 

22588 

San Tomas Creek 
SCVWD #29 

1.811 

.690 

1196 

Saratoga Creek* 
at Saratoga 

2.443 

.350 

1217 

Soquel Creek* 

3.126 

.428 

8162 

Upper Permanente 
SCVWD #32 

1.677 

.451 

320 

Uvas Creek* 

at Morgan Hill 

3.181 

.375 

7404 

Uvas Creek* above 
Uvas Reservoir 

2.977 

.434 

5940 

West Branch 

Soquel Creek* 

2.672 

.440 

3019 

Wildcat Creek 

SCVWD #30 

2.085 

.423 

727 

Zayante Creek 

2.587 

.464 

2747 


*The final 20 stations used for the peak regression analysis. 

**For skew g = -.66 K = 1.836 
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TABLE 5 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
SUM OF 3-DAY AVERAGE CFS 



Mean 

Standard 

Deviation 

1% 3-Day Volume 
(cr = -.68)** 

Aptos Creek* 

2.477 

.454 

2012 

Bodfish Creek* 

2,219 

.533 

1547 

Branciforte Creek* 

3.001 

.340 

4170 

Calabazas Creek* 
SCVWD #31 

2.119 

.418 

759 

Cedar Creek 

2.252 

.706 

3448 

Corralitos Creek* 

2.573 

.455 

2520 

Coyote Creek* 
near Gilroy 

3.380 

.499 

19439 

Coyote Creek 
at Madrone 

3.405 

.674 

42889 

Dry Creek 

1.966 

.682 

1614 

Fisher Creek* 

SCVWD #10 

2.176 

.732 

3228 

Hale Creek 

SCVWD #33 

1.541 

.814 

1055 

Los Animas Creek 
SCVWD #56 

2.162 

.701 

2745 

Los Gatos Creek* 

3.285 

.389 

9848 

Matadero Creek* 

2.051 

.476 

827 

Pacheco Creek 

3.034 

.890 

45153 

Packwood Creek 

SCVWD #57 

1.469 

.700 

554 

Permanente Creek* 
SCVWD #34 

1.991 

.718 

1988 

Redwood Creek* 

1.821 

.442 

422 


*The final 20 stations used for the peak regression analysis. 

**For skew g = -.68 K = 1.821 
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TABLE .5 (Continued) 


REGIONAL REGRESSION ANALYSIS 
STREAM GAGE STATISTICS 
SUM OF 3-DAY AVERAGE CFS 



Mean 

Standard 

Deviation 

1% 3-Day Volume 
(g = -.68)** 

Ross Creek* 

SCVWD #21 

1.784 

.499 

493 

San Francisquito* 

2.910 

.581 

9290 

San Lorenzo Creek* 
at Hayward 

2.809 

.541 

6225 

San Lorenzo River* 

3.729 

.388 

27261 

San Tomas Creek 
SCVWD #29 

1.910 

.682 

1419 

Saratoga Creek* 
at Saratoga 

2.555 

.344 

1518 

Soquel Creek* 

3.224 

.434 

10335 

Upper Permanente 
SCVWD #32 

1.794 

.488 

481 

Uvas Creek* 

at Morgan Hill 

3.272 

.384 

9359 

Uvas Creek* above 
Uvas Reservoir 

3.037 

.457 

7399 

West Branch 

Soquel Creek* 

2.774 

.430 

3606 

Wildcat Creek 

SCVWD #30 

2.148 

.414 

798 

Zayante Creek 

2.689 

.437 

3053 


*The final 20 stations used for the peak regression analysis. 

**For skew g = -.68 K = 1.821 
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APPENDIX E 
Frequency Curves 
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